# Multimodal Scoring
Audiobox Aesthetics
Unified automatic quality assessment model for speech, music, and sound
Audio Classification
Safetensors
A
facebook
56.27k
24
Uiclip Jitteredwebsites 2 224 Paraphrased Webpairs Humanpairs
MIT
UIClip is a model designed to quantify the design quality and relevance of user interface (UI) screenshots based on given text descriptions.
Multimodal Fusion
Transformers

U
biglab
232
0
Prometheus Vision 13b V1.0
Apache-2.0
The first open-source vision-language model specifically developed for evaluation tasks, demonstrating high correlation with both GPT-4V and human evaluators
Image-to-Text
Transformers English

P
prometheus-eval
121
12
Featured Recommended AI Models